Fault-tolerant disk storage and file systems using reflective memory

نویسنده

  • Nicos Vekiarides
چکیده

Most replicated storage and file systems either take a specialized hardware approach or a sofhuare-oriented approach to fault tolerance. This paper describes a fault-tolerant disk storage and file system that falls in between the hardware and software categories. The system uses Reflective Memory to interconnect an array of standard computers comprising a massively parallel system. This architecture provides the basis for highavailability replicated file and storage systems with the pelformance and low overhead expected from specialized hardware while offering the modularity and scalability of a distributed system. In this paper, we describe the implementation of the fault-tolerant file and storage system to run large scale IJO-intensive applications, such as emulation of a stable storage DASD subsystem. Preliminary perfarmance measurements indicate that selectively broadcasting regions of Reflective Memory allows for virtually no overhead over conventional systems for supporting replicated, distributed storage andfile services.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault Tolerant File Models for MPI-IO Parallel File Systems

Abstract. Parallelism in file systems is obtained by using several independent server nodes supporting one or more secondary storage devices. This approach increases the performance and scalability of the system, but a fault in one single node can make the whole system fail. In order to avoid this problem, data must be stored using some kind of redundant technique, so that it can be recovered i...

متن کامل

On Using Reliable Network RAM in Networks of Workstations

File systems and databases usually make several synchronous disk write accesses in order to make sure that the disk always has a consistent view of their data, and that data can be recovered in the case of a system crash. Since synchronous disk operations are slow, some systems choose to employ asynchronous disk write operations, at the cost of low reliability: in case of a system crash all dat...

متن کامل

The Design and Verification of the Rio File Cache

ÐToday's file systems are limited in speed and reliability by memory's vulnerability to operating system crashes. Because memory is viewed as unsafe, systems periodically write modified file data back to disk. These extra disk writes lower system performance and the delay period before data is safe lowers reliability. The goal of the Rio (RAM I/O) file cache is to make ordinary main memory safe...

متن کامل

End-to-end Data Integrity for File Systems: A ZFS Case Study

We present a study of the effects of disk and memory corruption on file system data integrity. Our analysis focuses on Sun’s ZFS, a modern commercial offering with numerous reliability mechanisms. Through careful and thorough fault injection, we show that ZFS is robust to a wide range of disk faults. We further demonstrate that ZFS is less resilient to memory corruption, which can lead to corru...

متن کامل

Design of Fault-Tolerant Large-Scale VOD Servers: With Emphasis on High-Performance and Low-Cost

ÐRecent technological advances in digital signal processing, data compression techniques, and high-speed communication networks have made Video-on-Demand (VOD) servers feasible. A challenging task in such systems is servicing multiple clients simultaneously while satisfying real-time requirements of continuous delivery of objects at specified rates. To accomplish these tasks and realize economi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995